Multi-armed bandit - PDFSEARCH.IO - Document Search Engine

Multi-armed bandit
Results: 113

#	Item
91	Hedged learning: Regret-minimization with learning experts Yu-Han Chang [removed] CSAIL, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA[removed]USA Leslie Pack Kaelbling Add to Reading List Source URL: people.csail.mit.edu Language: English - Date: 2005-11-02 21:38:45 Game theory Cybernetics Machine learning Search algorithms Learning Reinforcement learning Markov decision process Multi-armed bandit Algorithm Statistics Mathematics Applied mathematics
92	Hedged learning: Regret-minimization with learning experts Yu-Han Chang [removed] CSAIL, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA[removed]USA Leslie Pack Kaelbling Add to Reading List Source URL: www.machinelearning.org Language: English - Date: 2008-12-01 11:15:56 Game theory Cybernetics Machine learning Search algorithms Learning Reinforcement learning Markov decision process Multi-armed bandit Algorithm Statistics Mathematics Applied mathematics
93	Unimodal Bandits [removed] Jia Yuan Yu Ecole Normale Sup´erieure, HEC Paris, CNRS, France. Shie Mannor Add to Reading List Source URL: www.icml-2011.org Language: English - Date: 2011-06-01 14:49:36 Stochastic optimization Search algorithms Unimodality Real analysis Convex analysis Multi-armed bandit Normal distribution Markov decision process Mode Statistics Mathematical analysis Mathematics
94	Multi-armed Bandit Formulation for Autonomous Mobile Acoustic Relay Adaptive Positioning Mei Yi Cheung, Joshua Leighton, Franz S. Hover Abstract— We apply the stationary multi-armed bandit (MAB) formalism to the proble Add to Reading List Source URL: web.mit.edu Language: English - Date: 2013-07-15 22:25:46 Gittins index Multi-armed bandit Underwater acoustic communication Normal distribution Modem Variance Statistics Decision theory Design of experiments
95	Social User Agents for Dynamic Access to Wireless Networks P. Faratin and G. Lee and J. Wroclawski S. Parsons Laboratory for Computer Science Add to Reading List Source URL: groups.csail.mit.edu Language: English - Date: 2010-02-11 14:14:32 Markov processes Stochastic control Information systems Reinforcement learning Markov decision process Multi-armed bandit Q-learning Preference elicitation Machine learning Statistics Dynamic programming Artificial intelligence
96	An adaptive algorithm for finite stochastic partial monitoring G´ abor Bart´ ok [removed] Add to Reading List Source URL: icml.cc Language: English - Date: 2012-06-07 13:20:54 Game theory Artificial intelligence Search algorithms Control theory Game artificial intelligence Minimax Regret Observability Multi-armed bandit Statistics Decision theory Mathematics
97	Journal of Economic Theory 101, 252280[removed]doi:[removed]jeth[removed], available online at http:www.idealibrary.com on Learning While Searching for the Best Alternative Klaus Adam European University Institute, Via Add to Reading List Source URL: adam.vwl.uni-mannheim.de Language: English - Date: 2008-09-23 18:11:18 Multi-armed bandit Expected value Decision theory Search theory Secretary problem Statistics Stochastic optimization Machine learning
98	A modern Bayesian look at the multiarmed bandit Add to Reading List Source URL: www.economics.uci.edu Language: English - Date: 2011-03-31 14:42:20 Machine learning Multi-armed bandit Stochastic optimization Decision theory Gittins index Reinforcement learning Bandit Kullback–Leibler divergence Probability distribution Statistics Design of experiments Statistical theory
99	Journal of Machine Learning Research[removed]Submitted 4/00; Published[removed]Algorithms for the multi-armed bandit problem Volodymyr Kuleshov Add to Reading List Source URL: www.cs.mcgill.ca Language: English - Date: 2010-12-24 03:47:38 Mathematical optimization Machine learning Multi-armed bandit Cybernetics Reinforcement learning Algorithm Greedy algorithm Dynamic treatment regime Statistics Mathematics Stochastic optimization
100	Reducing Dueling Bandits to Cardinal Bandits Nir Ailon Technion, Dept. of Computer Science, Haifa 32000, Israel NAILON @ TECHNION . AC . IL Add to Reading List Source URL: jmlr.org Language: English - Date: 2014-06-18 10:58:08 Stochastic optimization Mathematical analysis Machine learning Multi-armed bandit Algorithm Polylogarithm Regret Statistics Mathematics Decision theory

UPDATE